SEQUENCE LISTING 



(1) GENERAIi INFORMATION: 



(i) APPLICANT: TAKAHASHI , Toiiru 

SERIZAWA, Nobufusa 
KOISHI, Ryuta 
KAWASHIMA, Iclliro 

(ii) TITLE OF INVENTION: EXPRESSION SYSTEMS UTILIZING 

AUTOLYZING FUSION PROTEINS 
AND A NOVEL REDUCING POLYPEPTIDE 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Frishhauf, Holtz, Goodman, Lamger & Chick, P.C. 

(B) STREET: 767 Tliird Avenue-25tli Floor 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 10017-2023 

^ (v) COMPUTER READABLE FORM: 

^ (A) MEDIUM TYPE: Floppy disk 

^ij (B) COMPUTER: IBM PC compatible 

ffi (C) OPERATING SYSTEM: PC -DOS /MS-- DOS 

^ (D) SOFTWARE: Patentin Release #1.24 

(vi) CURRENT APPLICATION DATA: 
4:! (A) APPLICATION NUMBER: US 08/500,63 5 

4^ (B) FILING DATE: ll-JUL-1995 

SI (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 6-161053 
4'^ (B) FILING DATE: 13-JUL-1994 

in (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 6-218392 
H (B) FILING DATE: 13-SEP-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 6-303809 

(B) FILING DATE: 07-DEC-1994 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Goodman, Herbert 

(B) REGISTRATION NUMBER: 17081 

(C) REFERENCE/DOCKET NUMBER: 950376/HG 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 319-4900 

(B) TELEFAX: (212) 319-5101 

(C) TELEX: 236268 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
V (A) LENGTH: 132 0 base pairs 

(B) TYPE: nucleic "acid 

(C) STRANDEDNESS ; double 



( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: CDNA CO mRNA 

(iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clover Yellow Vein Virus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1320 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: matjeptide 

(B) LOCATION: 10.. 1311 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AAG TTC CAA GGG AAA ACT AAG AGA ACA AGA CAA AAG TTG AAG TTC AGA 48 

Lys Phe Gin Gly Lys Ser Lys Arg Thr Arg Gin Lys Leu Lys Phe Arg 

15 10 15 

^ GCG GCA AGA GAC ATG AAG GAT CGT TAT GAA GTG CAT GCC GAT GAG GGG 96 

+* Ala Ala Arg Asp Met Lys Asp Arg Tyr Glu Val His Ala Asp Glu Gly 

f\l 20 25 30 

ACT TTA GTG GAA AAT TTT GGA ACT CGT TAT TCA AAG AAA GGC AAG ACA 144 
r^- Thr Leu Val Glu Asn Phe Gly Thr Arg Tyr Ser Lys Lys Gly Lys Thr 
3 5 40 45 

CI AAA GGT ACT GTT GTG GGT TTG GGT GCA AAA ACA AGA CGG TTC ACT AAC 192 

Lys Gly Thr Val Val Gly Leu Gly Ala Lys Thr Arg Arg Phe Thr Asn 
I\ 50 55 60 



ATG TAT GGT TTT GAC CCC ACG GAG TAT TCA TTT GCT AGG TAT CTT GAT 240 
Met Tyr Gly Phe Aso Pro Thr Glu Tyr Ser Phe Ala Arg Tyr Leu Asp 
65 * 70 75 80 

CCA ATC ACG GGT GCA ACA TTG GAT GAA ACC CCA ATT CAC AAT GTA AAT 288 
Pro lie Thr Gly Ala Thr Leu Asp Glu Thr Pro lie His Asn Val Asn 

85 90 95 

TTG GTT GCT GAG CAT TTT GGC GAC ATA AGG CTT GAT ATG GTT GAC AAG 33 6 

Leu Val Ala Glu His Phe Gly Astd lie Arg Leu Asp Met Val Aso Lys 
100 105 110 

GAG TTA CTT GAC AAA CAG CAC TTA TAC CTC AAG AGA CCA ATA GAA TGT 3 84 

Glu Leu Leu Asp Lys Gin His Leu Tyr Leu Lys Arg Pro lie Glu Cys 
115 120 125 

TAC TTT GTA AAG GAT GCT GGT CAG AAG GTG ATG AGG ATT GAT CTA ACA 432 
Tyr Phe Val Lys Asia Ala Gly Gin Lys Val Met Arg lie Asp Leu Thr 
130 135 140 

CCC CAC AAC CCA TTG TTG GCA AGC GAT GTT AGC ACA ACC AT.A. ATG GGT 480 
Pro*. His Asn Pro Leu Leu Ala Ser Asp Val Ser Thr Thr lie Met Gly 
145 150 155 160 



TAT CCT GAG AGA GAA GGT GAA CTC CGT CAA ACT GGA AAG GCA AGG TTA 528 
Tyr Pro Glu Arg Glu Gly Glu Leu Arg Gin Thr Gly hys Ala Arg Leu 

GTC GAG CCA TCA* GAG TTG CCC GCG CGG AAT GAG GAT ATT GAT GCA GAG 576 
Val Asp Pro Ser Glu Leu Pro Ala Arg Asn Glu Asp lie Asp Ala Glu 
180 185 130 

TTT GAG AGT CTA AAT CGC ATA AGT GGT TTG CGC GAC TAT AAT CCC ATT 624 
Phe Glu Ser Leu Asn Arg lie Ser Gly Leu Arg Asp Tyr Asn Pro lie 
X95 200 205 

TCA CAA AAT GTT TGC TTG CTA ACA AAT GAG TCA GAA GGC CAT AGA GAG 672 
Ser Gin Asn Val Cys Leu Leu Thr Asn Glu Ser Glu Gly His Arg Glu 
210 215 220 

AAG ATG TTT GGA ATT GGA TAT GGT TCA GTG ATC ATT ACA AAT CAA CAT 720 
Lys Met Phe Gly He Gly Tyr Gly Ser Val lie He Thr Asn Gin His 
225 230 235 240 

CTG TTC AGA AGG AAT AAT GGG GAG TTA TCA ATT CAA TCC AAG CAT GGC 763 
Leu Phe Arg Arg Asn Asn Gly Glu Leu Ser He Gin Ser Lys His Gly 
245 250 255 

TAC TTC AGA TGC CGC AAC ACC ACA AGC TTG AAG ATG CTG CCT TTG GAG 816 
Tyr Phe Arg Cys Arg Asn Thr Thr Ser Leu Lys Met Leu Pro Leu Glu 
260 265 270 

GGA CAT GAC ATT TTG TTG ATT CAG TTA CCA AGG GAC TTT CCA GTG TTT 864 
Glv His Asp He Leu Leu He Gin Leu Pro Arg Asp Phe Pro Val Phe 
^ 275 280 285 

CCA CAA AAG ATT CGC TTT AGG GAG CCA AGA GTG GAT GAC AAA ATT GTT 912 
Pro Gin Lys He Arg Phe Arg Glu Pro Arg Val Asp Asp Lys He Val 
290 295 300 

TTG GTC AGC ACA AAT TTC CAG GAA AAG AGT TCC TCG AGC ACG GTC TCA 
Leu Val Ser Thr Asn Phe Gin Glu Lys Ser Ser Ser Ser Thr Val Ser 
305 310 315 320 

GAG TCC AGT AAC ATT TCA AGA GTG CAG TCA GCC AAT TTC TAC AAG CAT 
Glu Ser Ser Asn He Ser Arg Val Gin Ser Ala Asn Phe Tyr Lys His 
325 330 335 

TGG ATC TCA ACA GTA GCA GGA CAC TGT GGA AAC CCT ATG GTT TCG ACT 1056 
Tro He Ser Thr Val Ala Gly His Cys Gly Asn Pro Met Val Ser Thr 
340 345 350 

AAA GAT GGA TTT ATT GTA GGT ATC CAC AGT CTT GCT TCA TTG ACA GGC 1104 
Lys Asp Gly Phe He Val Gly He His Ser Leu Ala Ser Leu Thr Gly 
355 360 365 

GAC GTT AAC ATC TTC ACA AGC TTT CCG CCG CAG TTT GAG AAC AAA TAT 1152 
Asp Val Asn He Phe Thr Ser Phe Pro Pro Gin Phe Glu Asn Lys Tyr 
370 375 380 

CTA CAG AAG CTC AGT GAA CAC ACA TGG TGT AGT GGA TGG AAA CTA AAT 1200 
Leu Gin Lys Leu Ser Glu His Thr Trp Cys Ser Gly Trp Lys Leu Asn 
385 390 395 400 

CTT GGA AAG ATT AGT TGG GGT GGA ATC AAC ATT GTG GAG GAT GCA CCT 1248 
Leut, Gly Lys He Ser Trp Gly Gly He Asn He Val Glu Asp Ala Pro 
405 410 415 



960 



1008 



GAA GAG CCC TTT ATA ACA TCC AAG ATG GCA AGC CTT CTT ACT GAT TTG 1296 
Slu III pro pS He Thr Ser Lys Met Ala Ser Leu Leu Ser Asp Leu 
420 

AAT TGT TCA TTC CAA GCA AGT GCG 1320 
Asn Cvs Ser Phe Gin Ala Ser Ala 
435 440 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clover Yellow Vein Virus 

(ix) FEATURE: 

(A) NAME /KEY: mat^peptlde 

(B) LOCATION: 4.. 437 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

LYS Phe Gin Gly Lys Ser Lys Arg Thr Arg Gin Lys Leu Lys Phe Arg 
1-5 10 1^ 

Ala Ala Arg Asp Met Lys Asp Arg Tyr Glu Val His Ala Asp Glu Gly 
20 25 30 

Thr Leu val Glu Asn Phe Gly Thr Arg Tyr Ser Lys Lys Gly Lys Thr 
35 40 45 

Lys Gly Thr Val Val Gly Leu Gly Ala Lys Thr Arg Arg Phe Thr Asn 
50 55 60 

Met Tyr Gly Phe Asp Pro Thr Glu Tyr Ser Phe Ala Arg Tyr Leu Asp 
65 70 75 80 

Pro He Thr Gly Ala Thr Leu Asp Glu Thr Pro He His Asn Val Asn 
85 50 95 

Leu val Ala Glu His Phe Gly Asp He Arg Leu Asp Met Val Asp Lys 
100 105 110 

Glu Leu Leu Asp Lys Gin His Leu Tyr Leu Lys Arg Pro He Glu Cys 
115 120 125 

Tvr Phe Val Lys Asp Ala Gly Gin Lys Val Met Arg He Asp Leu Thr 
130 135 140 

Pro His Asn Pro Leu Leu Ala Ser Asp Val Ser Thr Thr He Met Gly 
145 150 155 160 

Tyr Pro Glu Arg Glu Gly Glu Leu Arg Gin Thr Gly Lys Ala Arg Leu 
165 170 175 

Val' Aso P^o Ser Glu Leu Pro Ala Arg Asn Glu Asp He. Asp Ala Glu 
^ * 180 135 190 



Phe Glu Ser Leu Asn Arg He Ser Gly Leu Arg Asp Tyr Asn Pro He 

Ser Gin Asn Val Cys Leu Leu Thr Asn Glu Ser Glu Gly His Arg Glu 
210 215 220 

Lys Mec Phe Gly He Gly Tyr Gly Ser Val lie He Thr Asn Gin His 
225 230 235 

Leu Phe Arg Arg Asn Asn Gly Glu Leu Ser He Gin Ser Lys His Gly 
245 250 -433 

Tyr Phe Arg Cys Arg Asn Thr Thr Ser Leu Lys Met Leu Pro Leu Glu 
260 265 

Gly His ASP He Leu Leu He Gin Leu Pro Arg Asp Phe Pro Val Phe 
275 280 285 

Pro Gin Lys He Arg Phe Arg Glu Pro Arg Val Asp Asp Lys He Val 
290 295 300 

Leu val ser Thr Asn Phe Gin Glu Lys Ser Ser Ser Ser Thr Val Ser 
305 310 315 320 

Glu Ser Ser Asn He Ser Arg Val Gin Ser Ala Asn Phe Tyr Lys His 
325 330 -5-33 

Trp He Ser Thr val Ala Gly His Cys Gly Asn Pro Mec Val Ser Thr 
340 345 350 

Lys Asp Gly Phe He Val Gly He His Ser Leu Ala Ser Leu Thr Gly 



355 



Asp val Asn He Phe Thr Ser Phe Pro Pro Gin Phe Glu Asn Lys Tyr 
370 375 380 

Leu Gin Lys Leu Ser Glu His Thr Trp Cys Ser Gly Trp Lys Leu Asn 
385 390 395 400 

Leu Gly Lys He Ser Trp Gly Gly He Asn He Val Glu Asp Ala Pro 
405 410 

Glu Glu Pro Phe He Thr Ser Lys Met Ala Ser Leu Leu Ser Asp Leu 
420 425 430 

Asn Cys Ser Phe Gin Ala Ser Ala 
435 440 

( 2 ) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid, synthetic DNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GTCCATGGGG AAAAGTAAGA GAACA 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: bclier nucleic acid, synthetic DNA 
Ciii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ACTCTGAGAC CGTGCTCGAG 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: o tiler nucleic acid, syntlietic DNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
AGGAAAAGAG TTCCTCGAGC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other nucleic acid, synthetic DNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



V (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
AATTGTTCAT TCCAAGCACC TGGGCCACCA CCTGGC 



(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs , 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: o titer nucleic acid, syntiiietic DNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GCCAGGTGGT GGCCCAGGTG CTTGGAATGA ACAATT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid, synthetic DNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TTGTCAGCAC ACCTGGGAGC TGTAGAGCTC 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 sunino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Ala Pro Gly Pro Pro Pro Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 



(B) TYPE: aunino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 10 : 

Pro Gly Pro Pro Pro Gly Pro 
1 5 

(2) INFOBMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base paxrs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : dOTlbl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(H) CELL LINE: KM-102 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: KM31-7 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

( B ) LOCATION : 1 . . 1 647 
(D) OTHER INFORMATION: 

(ix) FEATXmE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 70,. 1647 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: sig_pept2.de 

(B) LOCATION: 1 . . 69 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG TCA TGT GAG GAC GGT CGG GCC CTG GAA GGA ACG CTC TCG GAA TTG 
Me? Ser Cys Glu Asp Gly Arg Ala Leu Glu Gly Thr Leu Ser Glu Leu 
-23 -20 -^5 

GCC GCG GAA ACC GAT CTG CCC GTT GTG TTT GTG AAA CAG AGA AAG ATA 
Sa III Glu Thr ASP Leu Pro Val Val Phe Val Lys Gin Arg Lys He 
-5 ^ ^ 

nrr rrc CAT GGT CCA ACC TTG AAG GCT TAT CAG GAG GGC AGA CTT CAA 
fl^ lly kIs oTy Pro ?hr Leu Lys Ala Tyr Gin Glu Gly Arg Leu Gin 
10 15 20 25 



AAG CTA CTA AAA ATG AAC GGC CCT GAA GAT CTT CCC AAG TCC TAT GAG 192 
Lys Leu Leu Lys Mec Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp 
30 35 40 

TAT GAG CTT ATC ATC ATT GGA GGT GGC TCA GGA GGT CTG GCA GCT GCT 240 
Tyr Asp Leu lie He He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala 
45 50 55 

AAG GAG GCA GCC CAA TAT GGC AAG AAG GTG ATG GTC CTG GAC TTT GTC 288 
Lys Glu Ala Ala Gin Tyr Gly Lys Lys Val Met Val Leu Asp Phe Val 
60 65 70 

ACT CCC ACC CCT CTT GGA ACT AGA TGG GGT CTT GGA GGA ACA TGT GTG 33 6 

Thr Pro Thr Pro Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val 
75 80 85- 

AAT GTG GGT TGC ATA CCT AAA AAA CTG ATG CAT CAA GCA GCT TTG TTA 384 
Asn Val Gly Cys He Pro Lys Lys Leu Met: His Gin Ala Ala Leu Leu 
90 95 100 105 

GGA CAA GCC CTG CAA GAC TCT CGA AAT TAT GGA TGG AAA GTC GAG GAG 432 
GlY Gin Ala Leu Gin Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu 
110 115 120 

ACA GTT AAG CAT GAT TGG GAC AGA ATG ATA GAA GCT GTA CAG AAT CAC 480 
Thr Val Lys His Asp Trp Asp Arg Met He Glu Ala Val Gin Asn His 
Q 125 130 135 

S ATT GGC TCT TTG AAT TGG GGC TAC CGA GTA GCT CTG CGG GAG AAA AAA 528 

^ He Gly Ser Leu Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys 

# 140 -145 150 

m GTC GTC TAT GAG AAT GCT TAT GGG CAA TTT ATT GGT CCT CAC AGG ATT 576 

val Val Tyr Glu Asn Ala Tyr Gly Gin Phe He Gly Pro His Arg He 
155 160 165 

AAG GCA ACA AAT AAT AAA GGC AAA GAA AAA ATT TAT TCA GCA GAG AGA 624 
□ Lys Ala Thr Asn Asn Lys Gly Lys Glu Lys He Tyr Ser Ala Glu Arg 

J 170 175 180 135 

rj.rj.rj. ^rj.^ ;^rprp j^Qfj. QQrj. p^QJ^ QQJ^ CGT TAC TTG GGC ATC CCT GGT 672 

Phe Leu He Ala Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly 
O 190 195 200 

GAC AAA GAA TAC TGC ATC AGC AGT GAT GAT CTT TTC TCC TTG CCT TAC 720 
Aso Lys Glu Tyr Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr 
205 210 215 

TGC CCG GGT AAG ACC CTG GTT GTT GGA GCA TCC TAT GTC GCT TTG GAG 7 68 

Cys Pro Gly Lys Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu 
220 225 230 

TGC GCT GGA TTT CTT GCT GGT ATT GGT TTA GAC GTC ACT GTT ATG GTT 816 
Cys Ala Gly Phe Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val 
235 240 245 

AGG TCC ATT CTT CTT AGA GGA TTT GAC CAG GAC ATG GCC AAC AAA ATT 864 
Arg Ser He Leu Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He 
250 255 260 265 

GGT GAA CAC ATG GAA GAA CAT GGC ATC AAG TTT ATA AGA CAG TTC GTA 912 
Gly Glu His Met Glu Glu His Gly He Lys Phe He Arg Gin Phe Val 
270 275 280 



CCA ATT AAA GTT GAA CAA ATT GAA GCA GGG ACA CCA GGC CGA CTC AGA 960 
Pro He Lvs Val Glu Gin He Glu Ala Gly Ttir Pro Gly Arg Leu Arg 
285 290 295 

GTA GTA GCT CAG TCC ACC AAT AGT GAG GAA ATC ATT GAA GGA GAA TAT 1008 
Val Val Ala Gin Ser Tlir Asn Ser Glu Glu He He Glu Gly Glu Tyr 
300 305 310 

AAT ACG GTG ATG CTG GCA ATA GGA AGA GAT GCT TGC ACA AGA AAA ATT 1056 
Asn Thr Val Met Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He 
315 320 325 

GGC TTA GAA ACC GTA GGG GTG AAG ATA AAT GAA AAG ACT GGA AAA ATA 1104 
Glv Leu Glu Tlir Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He 
330 335 340 345 

CCT GTC ACA GAT GAA GAA CAG ACC AAT GTG CCT TAC ATC TAT GCC ATT 1152 
Pro Val Thr Asp Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He 
350 355 360 

GGC GAT ATA TTG GAG GAT AAG GTG GAG CTC ACC CCA GTT GCA ATC CAG 12 00 

Gly Asp He Leu Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin 
365 370 375 

GCA GGA AGA TTG CTG GCT CAG AGG CTC TAT GCA GGT TCC ACT GTC AAG 1248 
Ala Gly Arg Leu Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys 
380 385 390 



TGT GAC TAT GAA AAT GTT CCA ACC ACT GTA TTT ACT CCT TTG GAA TAT 12 9 6 

71 Cvs ASP Tyr Glu Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr 

395 400 405 

M GGT GCT TGT GGC CTT TCT GAG GAG AAA GCT GTG GAG AAG TTT GGG GAA 13 44 

-'^ Glv Ala Cys Gly Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu 

410 415 420 425 

GAA AAT ATT GAG GTT TAC CAT AGT TAC TTT TGG CCA TTG GAA TGG ACG 13 92 

^ Glu Asn He Glu Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr 

1^1 430 435 440 

^ ATT CCG TCA AGA GAT AAC AAC AAA TGT TAT GCA AAA ATA ATC TGT AAT 1440 

^ He Pro Ser Arg Aso Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn 

Ui 445 * 450 455 

ACT AAA GAC AAT GAA CGT GTT GTG GGC TTT CAC GTA CTG GGT CCA AAT 1488 
Thr Lys Asp Asn Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn 
460 465 470 

GCT GGA GAA GTT ACA CAA GGC TTT GCA GCT GCG CTC AAA TGT GGA CTG 153 6 

Ala Gly Glu Val Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu 
475 480 485 

ACC AAA AAG CAG CTG GAC AGC ACA ATT GGA ATC CAC CCT GTC TGT GCA 1584 
Thr Lys Lys Gin Leu Aso Ser Thr He Gly He His Pro Val Cys Ala 
490 495 500 505 

GAG GTA TTC ACA ACA TTG TCT GTG ACC AAG CGC TCT GGG GCA AGC ATC 163 2 

Glu Val Phe Thr Thr Leu Ser Val Thr Lys Arg Ser Gly Ala Ser He 
510 515 520 

CTC CAG GCT GGC TGC TGA 1650 
Leu^ Gin Ala Gly Cys 
525 



(2) INFORMATION FOR SEQ ID NO : 12 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino 'acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Ser Cys Glu Asp Gly Arg Ala Leu Glu Gly Thr Leu Ser Glu Leu 
-23 -20 -15 -10 

Ala Ala Glu Thr Asp Leu Pro Val Val Phe Val Lys Gin Arg Lys lie 
-5 1 5 

Glv Glv His Gly Pro Ttu: Leu Lys Ala Tyr Gin Glu Gly Arg Leu Gin 
10 15 20 25 

Lvs Leu Leu Lys Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp 
30 35 40 

Tyr Asp Leu lie lie lie Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala 
45 50 55 

Lys Glu Ala Ala Gin Tyr Gly Lys Lys Val Met Val Leu Asp Pile Val 
60 65 70 

Thr Pro Thr Pro Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val 
75 80 85 

Asn Val Gly Cys lie Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu 
90 95 100 105 

Gly Gin Ala Leu Gin Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu 
110 115 120 

Thr Val Lys His Asp Trp Asp Arg Met lie Glu Ala Val Gin Asn His 
125 130 135 

lie Gly Ser Leu Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys 
140 145 150 

Val Val Tyr Glu Asn Ala Tyr Gly Gin Phe lie Gly Pro His Arg lie 
155 160 165 

Lys Ala Thr Asn Asn Lys Gly Lys Glu Lys lie Tyr Ser Ala Glu Arg 
170 175 180 135 

Phe Leu lie Ala Thr Gly Glu Arg Pro Arg Tyr Leu Gly lie Pro Gly 
190 195 200 

Asp Lys Glu Tyr Cys lie Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr 
205 210 215 

Cys Pro Gly Lys Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu 
220 225 230 

Cys Ala Gly Phe Leu Ala Gly lie Gly Leu Asp Val Thr Val Met Val 
' 235 240 245 



Arg Ser lie Leu Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He 
250 255 260 265 

Gly Glu His Met Glu Glu His Gly He Lys Phe lie Arg Gin Phe Val 
270 275 280 

Pro He Lys Val Glu Gin Xle Glu Ala Gly Thr Pro Gly Arg Leu Arg 
285 290 295 

Val Val Ala Gin Ser Thr Asn Ser Glu Glu He He Glu Gly Glu Tyr 
300 305 310 

Asn Thr Val Met Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He 
315 320 325 

Glv Leu Glu Thr Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He 
330 335 340 345 

Pro Val Thr Asp Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He 
350 355 360 

Glv Asp He Leu Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin 
365 370 375 

Ala Gly Arg Leu Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys 
380 385 390 

Cvs Asp Tyr Glu Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr 
395 400 405 

Gly Ala Cys Gly Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu 
410 415 420 425 

Glu Asn He Glu Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr 
430 435 440 

He Pro Ser Arg Asp Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn 
445 450 455 

Thr Lys Asc Asn Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn 
460 465 470- 

Ala Gly Glu Val Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu 
475 480 485 

Thr Lys Lys Gin Leu Asp Ser Thr He Gly He His Pro Val Cys Ala 
490 495 500 505 

Glu Val Phe Thr Thr Leu Ser Val Thr Lys Arg Ser Gly Ala Ser He 
510 515 520 

Leu Gin Ala Gly Cys 
525 

(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid, synthetic DNA 



(iii) HYPOTHETICAL: N 
(iv) ANTI- SENSE; N 

(xi) SEQUENCE DESCRXPTION: SEQ ID NO : 13 : 
TAAATAAATA AATAA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid, synthetic DNA 
(iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
CTAGCGCTCT GGGGCAAGCA TCCTCCAGGC TGGCTGCCAC CACCACCACC ACCACTGATC 

TAGACT 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other nucleic acid, synthetic DNA 

(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGTCAGCACA AATTTCCA. 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid, synthetic DNA 
\iii) HYPOTHETICAL: N 



(iv) ANTI-SENS2: M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AAACACAACT TGGAATGAAC AATT 
(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oclier nucleic acid, synthetic DNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCATTCCAAG TTGTGTTTGT GAAA 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Other nucleic acid, synthetic DNA 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CATAGGATGC TCCAACAA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Asa Cys Ser Phe Gin Xaa 
1 5 



